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(54) Title: MODIFYING PROTEIN IMMUNOGENICS 
(57) Abstract 

It is known mat proteins, or parts of mem, may be rendered non- or less immunogenic to humans or other species by identifying 
one or more potential T cell epitopes and eliminating mem by amino acid modification. Conventionally, certain epitopes may be retained 
in a protein sequence if the peptides constituting such epitopes are present in endogenous human protein, since they would be recognised 
as "self 1 . However, it has now been found that even self epitopes may give rise to immune reactions. The invention provides for their 
elimination, for example by recombinant DNA technology, to render mem more useful for administration to humans, for example for 
therapeutic or diagnostic purposes. r 



<D 

CO 



Q 
CX 

CD 

o 

o 

TD 



FOR THE PURPOSES OP INFORMATION ONLY 



Codes used to identify States party to the PCT on die front 



AL Attfcma 

AM Armenia 

AT fciatria 

AU Australia 

AZ Azerbai^n 

BA Bosnia &DdH( 

BB Barbados 



pages of pamphlets publishing international applications under the PCT, 



BF BnrttnaFaso 

BG Bulgaria 

BJ Benin 

BR Bran! 

BY Bdam 

CA Canada 

CF Central African Repcblic 

CC Congo 

CH Switzerland 

CI CotedTvoire 

CM Cameroon 

CN China 

CU Cuba 

C2 Czech RcpobHc 

DE German/ 

DK Denmark 



B8 Spam 

H inland 

Fit IW 

CA Gabon 

GB Ifekedtoogdoro 

CE Georgia 

GH Ghana 

CN Guinea 

GR Greece 

HU Hungary 

IE Ireland 

IL brae! 

IS Iceland 

IT Italy 

JP Japan 

KB Kenya 

KG Kyrgyzstan 

KP Democratic People's 

Republic of Korea 

KR RepabBc of Korea 

KZ Kazakstan 

LC Saint Loch 

U Ucchtewtem 

LK Sri Lanka 

LR Liberia 



u 


Lesotho 


vi 


L&hopiia 


w 


|merabomg 


LV 


Latvia 


MC 


Monaco 


MD 


RepnbBc of Moldova 


MG 


Madagascar 


MK 


Thefianner Yngcalav 




RepnbBc of Macedonia 


ML. 


Mafi 


MN 


Mongolia 


MR 


Manrftansa 


MW 


Malawi 


MX 


Mexico 


NB 


Niger 


NL 


rteoienanos 


NO 


Norway 


NZ 


New Zealand 


PL 


Poland 


PT 


PcftQgaS 


RO 


Romania 


RO 


Russian Federation 


SD 


Sudan 


SB 


Sweden 


SG 


Singapore 



St Stoveum 

SN Senegal 

SZ Swaziland 

I'D Chad 

TC Togo 
TJ 
TM 

TR Turkey 

TT Trinidad and Tobago 

UA Ukraine 

UG Uganda 

US United States of America 

UZ Uzbekistan 

VN Viet Nam 

YU Yngosiavia 

ZW Zhnbabwe 



WO 00/34317 



-1- 



PCT/GB99/04119 



10 



15 



20 



25 



MODIFYING PROTEIN IMMUNOGENICS 

rpHE PRESENT INVENTION relates to proteins to be adrninistered especially* 
X humans particularly for therapeutic use but also for use in diagnostic tests. The 
invention particularly provides for proteins which are modified to be less 
immunogenic than the unmodified counterpart when used in vivo. 

The invention particularly addresses the clinical need for a process by which the 
natural tendency of the host to mount an immune response against an administered 
protein is substantially reduced or eliminated. There are several examples where the 
administration of protein molecules offers therapeutic benefit However the benefit is 
greatly reduced particularly where multiple doses of the therapeutic protein are 
required as the recipient immune system recognises and accelerates the elimination of 
the incoming therapeutic protein. 

There are a number of therapeutic proteins whose therapeutic use is curtailed on 
account of their immunogenicity in man. For example, when murine antibodies are 
adrninistered to patients who are not immunosuppressed, a majority of patients mount 
an immune reaction to the foreign material by making human anti-murine antibodies 
(HAMA). There are two serious consequences. First, the patient's anti-murine 
antibody may bind and clear the therapeutic antibody or immunoconjugate before it 
haSachanceto bind to the tumour and perform its function. Second, the patient may 
develop an allergic sensitivity to the murine antibody and be at risk of anaphylactic 
shock upon any future exposure to murine immunoglobulin. 

Several techniques have been employed to address the HAMA problem and thus 
enable the use in humans of therapeutic monoclonal antibodies (see, for example WO- 
A-8909622, EP-A-0239400, EP-A-0438310, WO-A-9109967). A common aspect of 
these methodologies has been the introduction into the therapeutic antibody, which in 
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general is of rodent origin, of significant tracts of sequence identical to that present in 
human antibody proteins. Such alterations are also usually coupled to alteration of 
particular single amino acid residues at positions considered critical to maintaining the 
ahtibody-aritigen binding interaction. For antibodies, this process is possible owing to 
the very high degree of structural (and functional) conservatism between antibody 
molecules of different species. However for potentially therapeutic proteins where no 
structural homologue may exist in the host species (e.g. human) for the therapeutic 
protein, such processes are not applicable. Furthermore, these methods have assumed 
that the general introduction of human sequence will render the re-modelled antibody 
non-immunogenic. It is known however, that certain short peptide sequences ('T cell 
epitopes") can be released during the degradation of proteins within cells and 
subsequently presented by molecules of the major histocompatability complex (MHC) 
in order to trigger the activation of T cells. For peptides presented by MHC class II, 
such activation of T cells can then give rise to an antibody response by direct 
stimulation of B cells to produce such antibodies. None of the previous methods have 
addressed the elimination or avoidance of such epitopes in the final therapeutic 
molecule as a means to reduce or eliminate the antibody response against proteins. 
Nor have previous methods considered the elimination of peptides presented by MHC 
class I which can trigger a cytotoxic T cell response leading to cell killing. Such 
killing of cells causes the release of cellular components including proteins which can, 
in turn, activate specialist cells which are highly active in protein processing and MHC 
^esentjtio* an* also to release; & w&amo^y cytolrines as 4 *sult of such 
activation. As » result, au inflam^^ 

Hie more active uptake and processing of the protein for therapeutic use thus 



The elimination of T cell epitopes from proteins has been previously disclosed (see 
WO-A-9852976) whereby such potential T cell epitopes are denned as any peptide 
within the protein sequence with the ability to bind to MHC class H molecules. Such 
30 potential epitopes are measured by any computational or physical method to establish 
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MHC binding. Implicit in the term T cell epitope" is an epitope which is recognised 
by the T cell receptor and which can at least in principle cause the activation of these 
T cells, ft is however usually understood that certain peptides which are found to bind 
to MHC class H molecules may be retained in' a protein sequence because such' 
peptides are recognised as "self within the organism into which the final protein is 
administered. 

In practice, soluble proteins introduced into autologous organisms do sometimes 
trigger an immune response resulting in development of host antibodies which bind to 
the soluble protein. One example is interferon alpha 2 to which a percentage of human 
patients make antibodies despite the fact that this protein is produced endogenously. 

The present invention is based on the discovery by the inventors that MHC-binding 
peptides within autologous proteins can trigger immune responses in live organisms to 
those proteins even when the specific protein is endogenously produced. A possible 
explanation for this is that the doses of such proteins achninistered are much higher 
than normal providing MHC-peptide formation which can activate T cells. 
Alternatively, the physiological environment into which such administered proteins 
are presented is one favourable to efficient antigen processing and peptide-MHC 
formation at higher levels than usually encountered, for example in inflammatory 



Aecordmg to a first aspect of. the invefttfcn, there is provided * method of rendering a. 
protein, or part 6f a protein, non-finmunogenic, or less immunogenic, to a given 



(a) 



determining at least part of the amino acid seouence of thp nrotAin* 



(b) identifying in the amino acid sequence one or more potential epitopes for T 
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cells ("T cell epitopes") which are found in an endogenous protein of the given 
species; and 

(c) modifying ttie amino acid sequence to eliminate at least one of the T cell 
5 epitopes identified in step (b) thereby to reduce the immunogenicity of the protein or 
part thereof when exposed to the immune system of the given species. 

The given species will usually be human. In step (c), one or more actual epitopes or 
potential epitopes may be eliminated. 

10 

Therefore, the present invention provides a modified method for creating proteins 
which trigger a reduced or absent immune response whereby one or more MHC 
binding peptides which are also found in the autologous organism's endogenous 
proteins are modified to reduce or eliminate binding to MHC molecules. Thus, in 

15 practice, no assumption is made about which peptides might be tolerated by the 
organism (unless data is available which indicates that tolerance is in place). Overall, 
the invention encompasses the creation of protein molecules with reduced or absent 
immunogenicity in live organisms where one or more peptides which bind to MHC 
molecules are eliminated from the protein molecule. As such, the invention is an 

20 improvement on the prior art which focuses only on the removal of potential T cell 
epitopes from proteins with the assumption that the organism is tolerant to self 

Proteins or peptide Sequences, if particular, the in$#tlqn includes a lange of %fii$n 

■i 

an4 no^faunjan proteins which are mollified by ^o^catioft oC one p* mors Mgft 
binding peptides within the sequences. Such MHC binding peptides can include 
25 peptides which bind to MHC class II molecules and/or peptides which bind to MHC 
class L 

The aspect of the invention described above is preferred to the alternative strategy of 
sq>arately immunising the organism with such peptides in order to induce tolerance or 
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the alternative of administering analogues or fragments of the protein which resist 
binding or uptake into cells (especially antigen-presenting cells), for example by 
elimination of the cell binding site in the protein. Tolerising the organism to the 
peptide in this way ( e.g. the use of MHC binding peptides" identified from a syngeneic 
protein which can be used to induce tolerance in the host prior to administration of the 
entire protein) would require two therapeutic molecules, one to perform the main 
protein function and the other to induce the tolerance, which would not be favoured 
from the regulatory point of view. 



Any method of identifying - in which term is included predicting - one or more 
potential T cell epitopes can be used in the invention. Acceptable methods may be 
computational or physical and include measuring of the binding of MHC-peptide 
complexes to T cell receptors, the testing of potential T cell epitopes in transgenic 
mice expressing human MHC molecules, the testing of potential T cell epitopes in 
mice reconstituted with human antigen-presenting cells and T cells in place of their 
endogenous cells, and the testing of potential T cell epitopes for stimulation of T cells 
in vitro via presentation of peptides on MHC molecules using syngeneic antigen- 
presenting cells. 



20 



25 



As well as identifying in the amino acid sequence one or more potential T cell 
epitopes which are found in an endogenous protein of the given species, a method of 
flfeinVm^nay, and tyj&ally wi% inyofcfe additionally ^ identic and ehhjhiatrog 
one or more potential ^cell epitopes which are*©* fouadin. attendogenou>prote% of 
the given species, since such epitopes are even more likely to be immunogenic. 

The present invention is based on a new concept in development of therapeutic 
proteins. It has been recognised previously that, with exogenous T cell dependent 
proteins, helper T cell epitopes are important to development of a significant immune 
response to the autologous protein. Such T cell epitopes function through the internal 
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processing of the protein releasing peptides which complex with MHC class II 
molecules and which are then presented on the surface of appropriate cells for 
potential binding to receptors on T cells. However, it is very difficult to predict which 
peptides within the protein will be processed appropriately such that MHC binding can 
occur and it is also a factor that many processed peptides will not bind to all allotypic 
variants of MHC class II molecules (MHC restriction). Furthermore, in any given 
living organism, it is difficult to predict whether a T cell response will actually be 
triggered by a given peptide presented on MHC class II because T cells may have been 
tolerised to such epitopes or may not have the repertoire of T cell receptors to bind to 
the MHC class II-peptide complex. Due to these complicating factors, it has 
previously been impractical to develop proteins with reduced or absent 
immunogenicity because of the difficulty in predicting actual T cell epitopes. The 
invention herein primarily takes the new approach of creating improved therapeutic 
proteins by removal of potential rather than actual T cell epitopes (usually defined by 
1 5 binding to MHC molecules) such that certain peptides within the molecule which are 
not actually immunogenic may be altered in addition to those immunogenic peptides. 
For a therapeutic molecule, preferably all of the potential T cell epitopes are removed 
whilst retaining activity of the protein. Preferably, this involves judicious choice of 
the amino acid substitutions enabling removal of the T cell epitopes and usually will 
involve the testing of a range of variant molecules with different amino acid 
substitutions. 



20 



ttot all identi|§d |)otential T cell epitope a#hich are found in an endogeggus protein 
need be eliminated. For example, immune responses in general are not mounted to 

25 autologous circulating proteins, such as immunoglobulins and other serum proteins 
such as serum albumin. So potential T cell epitopes which are found for example in 
germBne immunoglobulin variable region protein sequences of the given species 
(which will generally be human) may sometimes be ignored. However, an epitope 
considered as not generally available to the immune system to gain tolerance may be 

30 identified as a potential epitope for elimination according to the method of the present 
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invention. Examples of such include epitopes include those of intracellular proteins, 
such as nuclear proteins and integral membrane proteins. 

The invention therefore provides proteins Which haVe been altered to reduce their 
immunogenicity as well as a general method of altering such proteins to reduce their 
immunogenicity. A major principle of the present invention is that proteins are altered 
by identification of potential T cell epitopes and their subsequent alteration within the 
proteins in order to eliminate such potential epitopes. Optionally, epitopes recognised 
by host antibodies ("B cell epitopes") can also be removed if these can be identified 
for example with the aid of patient antisera or if the surface residues in a non- 
autologous protein can be altered to those of a related autologous protein endogenous 
to the host organism without losing all of the desired activity of the non-autologous 
molecule. 



15 After elimination of one or more T cell epitopes, the protein may be tested for desired 
activity. It may retain its foil activity, or it may retain a sufficient proportion of its 
original activity to be useful. In some circumstances, the activity may be altered either 
beneficially or at least in an acceptable way. Proteins entirely lacking in useful 
function post modification may be discarded. 



20 



25 



30 



Proteins of the present invention include those which have potential clinical use in 

admimsliatiom of me proteins. They may be non-atttol^ous or autologous, fttftels 
of the present invention include proteins with an enzymatic activity which has a 
beneficial therapeutic effect, proteins which are used to convert inactive drugs to 
active drugs within living organisms, proteins which are used to vaccinate whereby 
certain immunogenic epitopes are undesirable, proteins which perform as carriers of 
other molecules within the living organism and proteins which bind to other molecules 
within or introduced within the living organism in order to alter the biodistribution of 
the other molecules. Amongst the examples of non-autologous proteins with potential 
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benefit for in vivo human use are the following: Thrombolytic proteins streptokinase 
and staphylokinase; pro-drug activating enzymes such as asparaginase and 
carboxypeptidase G2; toxins of plant, fungal and bacterial origin such as ricin, 
saponin, pokeweed aritivfral protein, mistletoe lectin, cholera toxin, pertussis toxin, 
diphtheria toxin; bacterial phospholipase C (e.g. PE33, PE35, PE37, PE38, PE40); 
ribotoxins such as alpha-sarcin, clavin, mitogillin, restrictocin and bryodin-i and 2. 
Other significant non-autologous molecules of proven and potential therapeutic benefit 
include streptavidin, cobra venom factor, insulin, collagen, non-autologous antibodies, 
non-autologous MHC molecules and non-autologous T cell receptor molecules! 
Autologous proteins of proven and potential therapeutic value include IL-2, interferon 
a and p, granulocyte macrophage colony stimulating factor (GMCSF), granulocyte 
colony stimulating factor (GCSF), tissue plasminogen activator (t-PA), insulin, Factor 
Vm, Factor DC, erythropoietin (EPO), pituitary growth hormone and megakaryocyte 
growth and development factor (MGDF) and derivatives of these proteins produced by 
15 recombinant methods. Other examples of proteins include recombinant molecules 
derived or comprised of single or multiple domains from autologous or non- 
autologous proteins, recombinant proteins engineered to change or modify function or 
protein sequences derived from any of the above molecules. 

20 A typical protocol within the general method of the present invention comprises the 
following steps: 

I ^tefeamimnf the amino acid sequence of me protein or a part hereof (if 
modification only of a part is required); 

Identifying potential T cell epitopes within the amino acid sequence of the 
protein by any method including detennination of the binding of peptides to 
MHC molecules, determination of the binding of peptideAfflC complexes to 
the T cell receptors from the species to receive the therapeutic protein, testing 
of the protein or peptide parts thereof using transgenic animals with the MHC 



n. 



30 
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molecules of the species to receive the therapeutic protein, or testing with 
transgenic animals reconstituted with immune system cells from the species to 
receive the therapeutic protein; 

5 m. By genetic engineering or other methods for producing modified proteins, 
altering the protein to remove one or more of the potential T cell epitopes and 
producing such an altered protein for testing; 

IV. (optionally) Within step EL, altering the protein to remove one or more of the 
10 potential B cell epitopes; 

V. Testing altered proteins with one or more potential T cell epitopes (and 
optionally B cell epitopes) removed in order to identify a modified protein 
which has retained all or part of its desired activity but which has lost one or 

15 more T cell epitopes. 

Potential T-cell epitopes herein are defined as specific peptide sequences which either 
bind with reasonable efficiency to MHC class H molecules (or their equivalent in a 
non-human species), or which in the form of peptide:MHC complexes bind strongly to 
the T cell receptors from the species to receive the therapeutic protein or which, from 
previous or other studies, show the ability to stimulate T-cells via presentation on 
MHC class EL The method of the present invention recognises that an effective T cell- 

of me immune system, Such a response requires the uptake of the therapeutic 
25 (foreign) protein by antigen presenting cells (APCs). Once inside such cells, the 
protein is jMocessed and I fragments of me protein fo^nn a complex with MHC class H 
molecules and are presented at the cell surface. Should such a complex be recognised 
by binding of the T cell receptor from T-cells, such cells can be, under certain 
conditions, activated to produce stimulatory cytokines. The cytokines will eKcit 
30 differentiation of B-cells to foil antibody producing cells. In addition, such T cell 
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responses may also mediate other deleterious effects on the patient such as 
inflammation and possible allergic reaction. 

It is understood that not all peptide sequences will be delivered into the correct MHC 
5 class n cellular compartment for MHC class U binding or will be suitably released 
form a larger cellular protein for subsequent MHC class H binding. It is former 
understood that even such peptides which are presented by MHC class II on the 
surface of APCs may not elicit a T-cell response for reasons including a lack of the 
appropriate T-cell specificity, tolerance by the immune system to the particular peptide 
10 sequence or the low affinity of the MHC-peptide complex for T cell receptor. 

The present invention provides for removal of human (or other given species) potential 
T-cell epitopes from the therapeutic protein whereby the primary sequence of the 
therapeutic protein can be analysed for the presence of MHC class fl binding motifs by 

15 any suitable means. For example, a comparison may be made with databases of 
MHC-binding motifs such as, by searching the "motifs" database at the world-wide 
web site httoy/wehih.wehi.edu.anATilirpftp/ (formerly wehil.wehi.edii.aii or 
wehi T wehi T edu.au). Alternatively, MHC class H binding peptides may be identified 
using computational threading methods such as those devised by Altuvia et al. ((J. 

20 Mol. Biol. 249 244-250 (1995))). Similar methods may be used to identify and 
remove epitopes which bind to MHC class I. 

Having identffieffpotential given species ^ human) t-ce£ epitopes, these epitope^ 
are then elinrmated by alterafion of one or more amino acids, as required to eliminate 

25 the T-cell epitope. Usually mis will involve alteration of one or more amino acids 
within the T-cell epitope itself. This could involve altering an amino acid adjacent to 
the epitope in terms of the primary structure of the protein or one which is not adjacent 
in the primary structure but is adjacent in the secondary structure of the molecule. The 
usual alteration contemplated will be amino acid substitution, but in certain 

30 circumstances amino acid deletion or addition will be appropriate. In some instances, 
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selection of the appropriate amino acid substitution at any particular point in the 
therapeutic protein primary structure, can be made with reference to an homologous 
protein primary structure. This is particularly the case where a repertoire of 
homologous genes exist, as for example with immunoglobulins, or a singie 

5 homologous gene for the therapeutic protein exists, as is the case for the cobra venom 
factor protein and human complement factor C3 detailed in Example 1. However in 
other cases, there may not be an homologous protein to the therapeutic protein as is the 
case for potential therapeutics such as the thrombolytic agent staphylokinase. In such 
cases amino acids may be selected on the basis of similar size and/or charge, or more 

10 preferably and where structural data for the protein exist in combination with or with 
reference to in silico protein modelling techniques. In some instances amino acid 
substitutions may be made to prevent proteolytic cleavage of a protein and thereby 
inhibit binding of a peptide to MHC class II. Examples of protease cleavage sites 
include those reported by van Noort & van der Drift (/. Biol Chem. 264 14159-14164 

15 (1989)) and van Noort et al (Eur. J. Immunol 21 1989-1996 (1991)). Examples of 
amino acids which prevent protease digestion have been identified e.g. Kropshofer et 
al 9 I Immunol 151 4732-4742 (1993). 

In the method of the present invention, usually a number of variants of altered proteins 
20 will be produced and tested for retention of the desired activity. Where it is desirable 
to maximise the removal of potential T cell epitopes from the protein, it will be 
common tp *?reat6 aratfge of variants including sstne; witti pQtgifial f cell epitopes 
remaining ifi the molecule. If will be recognised that with certain protein molecules, it 
is difficult to radically alter the molecule and to retain full activity and so judicious use 
25 of molecular modelling of variants and the testing of a maximum number of variants is 
desirable. For molecular modelling, standard commercially available software 
packages can be used to model the protein structure using as starting point either of a 
crystal structure or a model built from homology or protein folding prediction. 
Information on parts of the protein involved in imparting the molecule with its activity 
30 will assist in modelling variants enabling best choice of altered amino acids which 
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remove a potential T cell epitope (or optional B cell epitope) whilst retaining activity. 

In the method of the present invention where a number of variants of altered proteins 
will be produced and tested for retention of the desired activity, it is desirable to have 

5 a high-throughput method for screening large numbers of variants. Such methods 
include in vivo methods for expression of gene variants in cells such as bacteria with 
rapid isolation of the proteins for example using an antibody to a conserved region of 
the proteins, or via a protein purification tag included in the protein sequence. As an 
alternative, in vitro methods such as in vitro transcription and translation can be used 

0 in some cases. Where the protein binds to another molecule, display methods such as 
phage display and ribosome display can be used in order to select out variants which 
retain binding activity. 

In the practice of the present invention, specific amino acid alterations can be 
5 conducted by recombinant DNA technology, so that the final molecule may be 
prepared by expression from a recombinant host using established methods. However, 
the use of protein chemistry or any other means of molecular alteration is not excluded 
in the practice of the invention. Whilst the present invention provides a method for 
removal of potential T cell epitopes (and optional B cell epitopes) from protein 
0 molecules, the method does not exclude testing of variant molecules produced within 
the invention for actual T cell epitope activity including testing of the protein or 



to receive Jhe therapeutic protein, or testing with transgenic animals reconstituted with 

> receive me merapeutic protein. 



S 



Whilst such methods are hot yet routine, in vitro T cell assays can be undertaken 
whereby the protein can be processed and presented on MHC class H by appropriate 
antigen-presenting cells (APCs) to syngeneic T cells. T cell responses can be 
measured by simple proliferation measurements (especially if the APCs have been 
) irradiated or otherwise treated to prevent proliferation) or by measuring specific 
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cytokine release. In order to account for different MHC class H allotypes, a range of 
in vivo assays will usually be required in order to broadly test for T cell epitopes. 
Alternatively, transgenic animals equipped with human (or the desired species) MHC 
class H molecules could be used to test for t cell epitopes especially where the host 
5 MHC class II repertoire has been removed and especially where one or more other 
host accessory molecules in APC/T cell interaction have also been replaced with 
human (or the desired species) such as CD4 on T cells. 

In a further aspect, the invention provides molecules resulting from method as 
10 described above. Such molecules may be useful for treating or preventing a disease or 
condition. The invention also extends to the use of such molecules in in vivo and in 
vitro diagnosis and for human or veterinary use. Preferred features of each aspect of 
the invention are as for each other aspect, mutatis mutandis. 

15 The invention is illustrated but not limited by the following examples. The examples 
refer to the following figures: 

FIGURE 1 shows the protein sequence of mature cobra venom factor, 

20 FIGURE 2 shows the protein sequence of an altered cobra venom factor - 

chain; 

FIGURE 3 shots the protein twquence of an alteted cobra venom fitctpr - 
chain; 

FIGURE 4 shows the protein sequence of an altered cobra venom factor - 
chain; 



30 



FIGURE 5 shows the protein sequence of streptokinase from Streptococcus 
eqmsimilis-y 
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FIGURE 6 shows the protein sequence of an altered streptokinase molecule; 

FIGURE 7 shows the protein sequence of mature staphylokinase; 

FIGURE 8 shows theprotein sequence of an altered staphylokinase molecule; 

FIGURE 9 shows the protein sequence of wild-type bryodin 1 ; 

FIGURE 10 shows the protein sequence of an altered bryodin 1 molecule; 

FIGURE 11 shows the protein sequence of mature human interferon 2 ; and 

FIGURE 12 shows the protein sequence of an altered human interferon 2 
15 molecule. 

Example 1 

In the present example the cobra venom factor (CVF) protein is analysed for the 
presence of potential MHC binding motifs and a method disclosed for the removal of a 
20 number of these from the molecule. 

€VE fe the no^xife complementing gtycoprotep & cobra venom, Th* 
mature CVF protein consist* of three polypepfide chains % JJ and>, produced by post 
traiislanoiialproi^ssingof a pre-pro pretein of approximate molecular weight 150kT*a 
25 (Fritzinger D.C. et al., Proa Nat'l. Acad. Set USA 91 12775-12779 (1994)). The 
aandpchaiiwarebofolmkedtofoeych^ In addition, 

both the a chain and the y chain have single intra chain disulpbide bonds whilst the p 
chain has a complex arrangement of six further internal disulphide bonds. The mature 
protein has four potential sites for N-glycosylation of which only three are used 
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(Vogel, C.W & Muller-Eberhard J. Immunol 18 125-133 (1984)). 

In the preferred embodiment of the present invention, production of the reduced 
oncogenicity or altered version of the therapeutic protein is by recombinant D*A 
technology. For the present example, it is understood that the expedient of 
recombmant production of the mature CVF protein alone is likely to provide a 
considerable reduction in immunogenicity of the protein. This assertion stems from 
the knowledge that a degree of the immunogenicity arising fro m administration of 
CVF purified from Naja naja venom, arises from antibody responses to the particular 
pattern of glycosylation not encountered in mammalian proteins (Gowda, DCetal J. 
Immunol. 152 2977-2986 (1994)). It is also understood that deglycosylated and aiso 
sialylated CVF that would arise from recombinant production of CVF, for example in 
a mammalian producer cell system, have equal functional activity with native mature 
CVF (Gowda, D.C et al. (1994) ibid.). 

In humans CVF triggers the alternative (properdin) pathway of complement activation. 
CVF forms a bimolecular complex with factor B which is cleaved by factor D into Bb 
which remains bound to CVF to form the C3/C5 convertase CVF,Bb. This complex is 
funcnonally and structurally analogous to the human C3/C5 convertase formed during 
the activation of the alternative pathway. The clinical importance of CVF lies in the 
nigh relative stability of the CVF,Bb complex versus the native C3/C5convertase (t m 
of 

disrupted wd hence down regulated by factors H and I (Gowda, D.C et <^ (&94) 
The net result of CW adtninistration therefore is to lead to the depletion of 
complement in human serum by continuous efficient and uncontrolled complement 
activatioiL 

Tbis property has been exploited as a research tool for in vivo and in vitro de- 
complementation (Cochrane C.Q et al, J. Immunol 105 55 (1970)) and has also been 
exploited for the selective elimination of tumour cells as a conjugate with a tumour 
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targeting monoclonal antibody (Vogel, C.-W. & Muller-Eberhard H.J. Proc. Natl 
Acad. Sci. USA 78 7707 (1981)). Availability of a non-immunogenic CVF would have 
considerable importance as a potential therapeutic agent for the depletion of 
complement in the plasma of patients undergoing organ xenotransplantation or in the 
construction of second generation antibody conjugates for targeted activation of 
complement in cancer patients. 



Method for identification of potential MHC class II binding motifs in cobra 
Venom Factor: 

10 The sequence of CVF was identified from the GenBank database. The sequence with 
accession number U09969 was used throughout. The protein sequences of the mature 
a, p and y chains were identified (Figure 1) and analysed individually for the presence 
of potential MHC class II binding motifs. Analysis was conducted by computer aided 
comparison to a database of MHC-binding motifs as resident on world wide web site 

15 http://wehih.wehi.edu.au/mhcpRp/ 

Results of the "searching" process on the a chain, indicate the presence of 638 
potential MHC class H binding motifs. Of these, 525 matched sequences identified in 
a database of human gennline immunoglobulin variable region protein sequences. 
These epitopes were not considered further on the basis that immune responses in 
general are not mounted to autologous circulating proteins such as immunoglobulins. 
Thi&fmpfies i^^logi(^ toletance'to the potential T-cell epitopes present in the 
strocturS of thfc inimunogtobulihs (and indeed the majority of human proteins): 
Epitopes presented by non-autologous proteins such as CVF which are identical or 
25 similar to motifs present in immunoglobulin proteins are likely also to be tolerated and 
in practice may be retained through the de-immunisation process. 



20 



Following subtraction of the human immunoglobulin protein germhne motifc, the 
remaining 113 potential epitopes in the a chain, were analysed individually for 
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similarity to non-inmmnoglobulin protein sequences. In practice, predicted anchor 
residues for each potential epitope were used in a consensus sequence search of human 
expressed proteins. The SwissProt and GenBank translated sequence databases were 
interrogated using commercially available software (DNAstar kadisbn, WI, USA). 
Epitopes identified in known circulating human proteins were not considered further 
and were therefore allowed to remain unchanged within the CVF a chain. An 
example of one such rejected potential epitope is given by the sequence FKPGMPY at 
positions 332-338 in the CVF a chain (numbering from first amino acid of the mature 
a chain). This sequence represents a predicted consensus binding motif for HLA- 
DRQB1*0301 with anchor residues underlined. Database searching using the 
consensus sequence FxxxMxY identifies only a single entry in a human protein sub- 
set of the SwissProt database corresponding to human serum biotinidase (SwissProt 
accession # A54362). An example of an epitope where no match to a human protein 
considered to be in the general circulation was found is provided by sequence 
YYQVGNNEIat position 501-509 in the CVF a chain. This sequence represents a 
potential epitope for presentation by HLA-DRB1*0101. Consensus sequence 
searching identifies only four human proteins containing this motif, of which three are 
nuclear proteins of differentiated tissues such as brain, and the fourth is an integral 
membrane protein. These may be considered as not generally available to the immune 
system to gain tolerance and therefore identify this as a potential epitope for 
eliinination according to the method of the present invention. Similarly, a further 
potaMfel IHA«)RU,indmg motif was identified & the d$haift Peptide sequence 
XYWaVESfe at positions &M9. This motif identifies a single toman nuclear 
protein DNA polymerase alpha (GenBank accession # M64481), in the same data set 
and was also therefore identified for modification by the method of the present 
invention. Following these processes, a total of 15 potential epitopes in the a chain 
were considered for removal by amino acid substitution. 

Results of the "searching" process on the p chain, indicate the presence of 366 
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potential MHC class II binding motife. Of these, 281 matched sequences identified in 
a database of human gennline immunoglobulin variable region protein sequences and 
were not considered further. Following a second round of subtractions using a 
database of human serum protein sequences (as above), 18 potential epitopes were 
5 considered for removal. 



Results of the "searching" process on the y chain, indicate the presence of 267 
potential MHC class H binding motife. Of these, 219 matched sequences identified in 
a database of human germline immunoglobulin variable region protein sequences and 
10 were not considered further. Following a second round of subtractions using a data 
base of human serum protein sequences (as above), 9 potential epitopes were 
considered for removal. 

The net result of these processes was to identify those residues within the CVF 
15 molecule which should be altered to eliminate potential MHC class II binding motifs. 
Individual amino acids within the predicted binding motifs were selected for 
alteration. With the object of maximising the likelihood of maintaining protein 
functional activity, in all cases conservative amino acid substitutions were chosen at 
any given site. In many cases, individual amino acid or even short strings (<5 
20 consecutive residues) were substituted by reference to the published sequence of 
human complement factor 3 (C3) (GenBank accession # K02765) which shows areas 
of sttgng homotegyi including iegi§nro| idem% wifh CVF. In tfoo instances- 
potential epitopes were eliminated from the p chain by insertion of additional amino 
acids. Amino acids were selected with reference to the human C3 protein. 

25 

Altered CVF a chain, p chain and y chain sequences were compiled (Figures 2 to 4) 
and further analysed by database comparison, as previously, for confirmation of 
successful elimination of potential MHC class U binding motifs. 
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Method for construction of altered CVF molecules: 

PCR primers CVF 1 for, 5 ' ATAAGAATGCGGCCGCATGGAGAGGATGGCTCTCT 
and CVFlrev; 5'ATAAGAATGCGGCCGCTATCATTGATTCTTCTGAAC were 
used to amplify a 4985bp frajgment from a Xgtl l cobra venom gland cDNA library 
5 (Fritzinger DC et a/., ibid.). The PCR was conducted on a total library DNA 
preparation using high fidelity polymerase mix optimised for long distance PCR 
(Advantage PCR Kit, Clontech, Basingstoke, UK) and conditions suggested by the 
supplier. The PCR product was cloned into pcDNA3.1 (Invitrogen, Leek, The 
Netherlands) as a Notl restriction fragment using standard techniques (Sambrook J., 

10 Fritsch E.F. & Maniatis T. (eds.) in: Molecular Cloning a Laboratory Manual, Cold 
Spring Harbor Laboratory Press, NY, USA (1989)). The gene sequence was 
confirmed to be identical to database entries using commercially available reagent 
systems and instructions provided by the supplier (Amersham, Little Chalfont, UK). 
Site directed mutagenesis was conducted using synthetic oligonucleotides and the 

15 "quick-change" procedure and reagents from Stratagene (Cambridge, UK). Mutated 
(de-immunised) versions of the gene were confirmed by sequencing. Mutated CVF 
genes were transfected into CHO cells by electroporation. Stable transfectants were 
selected using G418, and clones secreting active CVF woe selected using a C 
consumption assay (Gowda D.C et al (1994) ibid; Cochrane C.G. et al (1970) ibid) 

20 Expressing clones were expanded and recombinant protean was purified from the 
culture supernatant using sequential column chromatography and methods as 




Example 2 

25 The present invention details a process whereby potentially immunogenic epitopes 
within a non-autologous protein may be identified and offers methodology whereby 
such epitopes may be eliminated. It is understood that there are a number of proven 
therapeutic proteins for which their therapeutic use is curtailed on account of their 
immunogenicity in man. In the present example the therapeutic protein streptokinase 
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is analysed for the presence of potential MHC binding motifs and a method disclosed 
for the removal of a number of these from the molecule. 

Streptokinase (SK) is a single chain protein of approximate molecular weight 47kDa 
5 that is produced by certain strains of p-haemolytic streptococci (Huang TT. et al y 
Mol Biol 2 197-205 (1989)). The protein has no inherent enzymatic activity but has 
considerable clinical importance owing to its ability to efficiently bind human 
plasminogen, potentiating its activation to plasmin and thereby promoting the 
dissolution of fibrin filaments in blood clots. Several studies have shown that SK is an 

10 effective thrombolytic agent in the treatment of coronary thrombosis, improving 
survival (ISIS-2 Collaborative Group, Lancet 2 349-360 (1988)) and preserving left 
ventricular function following myocardial infarction (ISAM Study Group, N. Engl. Jl 
Med. 314 1465-1471 (1986); Kennedy J.W. et al y Circulation 77 345-352 (1988)). 
Despite the undoubted therapeutic value of SK, the non-autologous origin of the 

15 protein is disadvantageous due to its immunogenicity in humans. The production of 
neutralising antibodies in the patient in generally limits the protein to a single use. 

Method for identification of potential MHC class H binding motifs in 
streptokinase: 

20 The sequence of streptokinase was identified from the GenBank database. The 
sequence with accession number S46536 was used throughout (Figure 5). The 
&qufcn$e was analysed tot t»^|»te^ce pf|p<^tial MI|C class if ft&dwg pbtife by 
computer *id$d comparison to a database of MHC-bin(iiirg motife as resident on world 
wide web site htto://wduh.wehi.edu.au/mhcp epA 
^25/^'^- >. " "•■ ". v ^'r-- ' •• • - •< - 

Results of the "searching" process indicate the presence of 395 potential MHC class II 
binding motife. Of these, 283 matched sequences identified in a database of human 
germline immunoglobulin variable region protein sequences. These epitopes were not 
considered further on the basis that immune responses in general are not mounted to 
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autologous circulating proteins such as immunoglobulins. This implies 
immunological tolerance to the potential T-cell epitopes present in the structure of the 
immunoglobulins (and indeed the majority of human proteins). Epitopes presented by 
non-autologdus proteins such as SK which are identical or similar to motifs present in 
5 immunoglobulin proteins are likely also to be tolerated and in practice may be retained 
through the de-immunisation process. 

Following subtraction of the human immunoglobulin protein geimline motifs, the 
remaining 112 potential epitopes were analysed individually for similarity to non- 
10 immunoglobulin protein sequences. In practice, predicted anchor residues for each 
potential epitope were used in a consensus sequence search of human expressed 
proteins. The SwissProt and GenBank translated sequence databases were 
interrogated using commercially available software (DNAstar Madison, WI, USA). 
Epitopes identified in known circulating human proteins were not considered further 
15 and were therefore allowed to remain unchanged within the SK molecule. An 
example of one such rejected potential epitope is given by the sequence LLKAIQEQL 
at positions 79-87 in the SK protein. This sequence represents a predicted consensus 
binding motif for HLA-DR1*0101 with anchor residues underlined. Database 
searching using the consensus sequence LxxAxxxxL identifies >4000 entries in a 
20 human protein sub-set of the SwissProt database, including serum albumin protein 
(SwissProt accession number P02768). An example of an epitope where no match to a 
human protein considered to % Si the genefal Gireulatfoii w§s found is provided by 
sequence XOT)YN^at position 299-305 in the SK protein. This^sequende regents 
a potential epitope for presentation by HLA-DR4*0401. Consensus sequence 
25 searching identifies < 50 human proteins containing this motif; of which many are 
intracellular proteins of differentiated tissues such as brain. These may be considered 
as not generally available to the immune system to gain tolerance and therefore 
identify this as a potential epitope for elimination according to the method of the 
present invention. Similarly, a further potential HLA-DR1*0101 binding motif was 
30 identified in the SK peptide sequence KADLLKAI at positions 76-83 of the SK 
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protein. This motif identifies < 150 human proteins in the same data set and was also 
identified for modification by the method of the present invention. 

The net result of these processes was to identify those residues within the SK molecule 
5 which should be altered to eliminate potential MHC class II binding motifs. 
Individual amino acids within the predicted binding motifs were selected for 
alteration. With the object of maximising the likelihood of maintaining protein 
functional activity, in all cases conservative amino acid substitutions were chosen at 
any given site. A new (de-immunised) SK sequence was compiled (Figure 6) and 
10 further analysed by database comparison, as previously, for confirmation of successful 
elimination of potential MHC class II binding motifs. 

Method for construction of de-immunised SK molecules: 

PGR primers SKI (5'-ggaattcatgattgctggacctgagtggctg) and SK2 (5'- 
15 tggatccttatttgtcgttagggtatc) were used to amplify the wild-type SK gene from a strain 
of Streptococcus equisimililis group C (ATCC accession number 9542). The resulting 
1233bp fragment was cloned into pUC19 as a BamHl-EcoTU restriction fragment 
using standard techniques (Sambrook J., Fritisch E.F. & Maniatis T. (eds.) in: 
Molecular Cloning a Laboratory Manual, Cold Spring Harbor Laboratory Press, NY, 



20 USA (1989)). The gene sequence was confirmed to be identical to database entries 
using commercially available reagent systems and instructions provided by the 




©sing synthetic ^gom^leotides and the "qmckHcharige" procedure and regents from 
Stratagefce UK Ltd. Mutated (de-ifomunised) versions of the gene were confirmed by 



't^-yf^^- 25 sequencing. Mutated SK genes were sub-cloned as EcoBl-BamHl fragments into the 
bacterial expression vector pEKG-3 (Estrada M.P, et al. 9 Bio/Technology 10 1138- 
1 142 (1992)) for expression of de-immunised SK. Recombinant protein was purified 
using a plasminogen affinity column according to the method of Rodriguez et al 
(Rodriguez P. et a/., Biotechniques 7 638-641 (1992)). Fibrinolytic activity was 
30 assessed using the casein/plasminogen plate technique and the in vitro clot lysis assay 
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as described by Estrada et al (Estrada et al, ibid.). 
Example 3 

' In the present example staphylokinase is analysed for the presence of potential MHC 
5 binding motife and a method disclosed for the removal of a number of these from the 
molecule. 

Staphylokinase protein from Staphylococcus aureus has recognised and well 
characterised profibrinolytic properties. Recombinant forms of the protein have been 

10 previously produced and in vitro and in vivo studies have indicated that the protein 
holds considerable promise for thrombolytic therapy (Sako, T., Eur. J. Biochem. 149 
557-563 (1985); Schlott, B. et al., Biotechnology 12 185-189 (1994); Collen, D. et al. y 
Circulation 94 197-462 (1996)). However, clinical use in humans has been limited 
due to the demonstrated immunogenicity of the protein in man (Collen, D. et al. y 

15 Circulation 95 463-472 (1997)). Availability of a non-immunogenic staphylokinase 
would have considerable importance as a potential agent for thrombolytic therapy. 

The mature staphylokinase protein consists of a single polypeptide chain of 137 amino 
acids with approximate molecular weight 15.4kDa (Silence K. et al % J Biol 
20 Chemistry, 270 27192-27198 (1995)). 

staphylokinase* 

the sequoice of ttaphylokiiiase (sakSTAR) protein as given in table 1 of Collen et al 
25 (Collen, D et of. (1996) ibid.) was used throughout The protein sequences of the 
mature staphylokinase (Figure 7) were analysed for the presence of potential MBC 
class II binding motifs. Analysis was conducted by computer aided comparison to a 
database of MHObinding motifi as resident on world wide web site 
http://wehih.weh i > edu.auAnhcpepA 

30 
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Results of the "searching" process indicate the presence of 128 potential MHC class II 
binding motifs. Of these, 91 matched sequences identified in a database of human 
germline immunoglobulin variable region protein sequences. These epitopes were not 



5 staphylokinase which are identical or similar to motifs present in immunoglobulin 
proteins are likely to be tolerated and in practice may be retained through the de- 
immunisation process. 

Following subtraction of the human immunoglobulin protein germline motifs, the 
10 remaining 37 potential epitopes were analysed individually for similarity to non- 
immunoglobulin protein sequences. In practice, predicted anchor residues for each 
potential epitope were used in a consensus sequence search of human expressed 
proteins. The SwissProt and GenBank translated sequence databases were 
interrogated using commercially available software (DNAstar Madison, WI, USA). 
15 Epitopes identified in known circulating human proteins were not considered further 
and were therefore allowed to remain unchanged within the protein. 

The net result of these processes was to identify those residues within the 
staphylokinase molecule which should be altered to eliminate potential MHC class II 
20 binding motifs. Individual amino acids within the predicted binding motifs were 
selected for alteration. With the object of maximising the likelihood of maintaining 



considered 'further. Epitopes presented by non-autologous proteins such as 




chosen at any given site. 



25 An altered st^hylokinase sequence was compiled (Figure 8) and further analysed by 
database comparison, as previously, for confirmation of successful elimination of 
potential MHC class II binding motifs. 



Method for construction of altered staphylokinase molecules: 
30 A wild-type staphylokinase gene was synthesised under contract by Genosys 
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Biotechnologies Ltd (Cambridge, UK). The gene was constructed by PCR using long 
(80-mer) overlapping synthetic primers and the sequence as given by Collen, D et al 
(Collen, D. et al, (1996) ibid) The synthetic gene was cloned as a 453 bp £coRI- 
HihdJJl restriction fragment into bacterial expression vector pMEX (MoBifec, 

5 Gottingen, Germany). The gene sequence was confirmed to be identical to database 
entries using commercially available reagent systems and instructions provided by the 
supplier (Amersham, Little Chalfont, UK). Altered (reduced immunogenicity) 
versions of the gene were engineered using site directed mutagenesis of the wild-type 
gene in pMEX. Short (18-mer) synthetic oligonucleotides and the "quick-change" 

10 procedure and reagents from Stratagene (Cambridge, UK) were used to create the 
variant genes. All variant gene sequences wore confirmed by DNA sequencing. 

Mutated staphylokinase genes were transformed into E. coli strain TGI by standard 
techniques. A single transformed clone was selected and clones secreting active 
15 staphylokinase were selected using a fibrin plate assay (Astrup, T. et al, Arch 
Biochem. Biophys. 40: 346-351 (1952); Collen, D. et al, Fibrinolysis 6: 203-213 
(1992)). The best expressing clone was grown up and recombinant protein was 
purified from the culture supernatant using sequential column chromatography and 
methods as described previously (Collen, D. et al, (1992) ibid; Schlott et al, ibid). 



20 



25 



Example 4 

Jgi $h$ pns&lt example %ryo$n I is analysed for&e 
binding motifs and a method disclosed for file removal bf a number of thesS from the 
molecule. 



The gene for bryodin 1 protein has recently been cloned from Bryonia dionia, a 
member of the Cucurbitaceae family of plants (Gawlak, S. et al, Biochemistry 36 
3095-3103 (1997)). Bryodin 1 is a type 1 ribosome inactivating protein. Studies using 
recombinant forms of the protein have indicated that bryodin 1 holds considerable 
30 promise for immunotoxin therapy for cancer and other diseases (Gawlak, S. et al n 
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(1997) ibid.). However, clinical use in humans as with other immunotoxin agents, is 
likely to be curtailed due to immunogenicity of the protein in man. Availability of a 
non-immunogenic bryodin would have considerable importance as a potential 
component in immunotoxin based therapies. 

5 

The mature bryodin protein consists of a single polypeptide chain of 267 amino acids 
with approximate molecular weight 29kDa. The wild-type sequence is illustrated in 
Figure 9. 

10 Method for identification of potential MHC class II binding motifs in bryodin 1: 

The sequence of bryodin 1 protein as given by Gawlak et al. $ (Gawlak, S. et aL, (1997) 
ibid.) was used throughout. The protein sequence of the mature bryodin 1 (Figure 9) 
was analysed for the presence of potential MHC class II binding motifs. Analysis was 
conducted by computer aided comparison to a database of MHC-binding motife as 
15 resident on world wide web site http://wehih.wehi.edu.au/mhcpep/ . 

Results of the "searching" process indicate the presence of 315 potential MHC class II 
binding motife. Of these, 259 matched sequences identified in a database of human 
germline immunoglobulin variable region protein sequences. These epitopes were not 
20 considered further. 

Following su&fraction of th£ fcumaa immunoglobulin prot*& geimlme motifs, the 
remainfag 56 potential epitopes were analysed individually for si&iiarity to non- 
immunoglobulin protein sequences. The predicted anchor residues for each potential 
25 epitope wore used in a consensus sequence search of human expressed proteins. The 
SwissProt and GenBank translated sequence databases were interrogated using 
commercially available software (DNAstar Madison, WI, USA). Epitopes identified 
in known circulating human proteins woe not considered further and were therefore 
allowed to remain unchanged within the protein- 



30 
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The net result of these processes was to identify those residues within the bryodin 1 
molecule which should be altered to eliminate potential MHC class II binding motifs. 
For biyodin 1 protein* 13 potential epitopes ware identified for elimination from the 
molecule. Individual amino acids within the predicted binding motifs were selected 
5 for alteration. With the object of maximising the likelihood of maintaining protein 
functional activity, in all cases conservative amino acid substitutions were chosen at 
any given site. 

An altered biyodin 1 sequence was compiled (Figure 10) and further analysed by 
10 database comparison, as previously, for confirmation of successful elimination of 
potential MHC class II binding motifs. 

Method for construction of altered bryodin 1 molecules: 

A wild-type bryodin 1 gene was synthesised under contract by Genosys 
15 Biotechnologies Ltd (Cambridge, UK). The gene was constructed by PCR using long 
(80-mer) overlapping synthetic primers and the sequence as given by Gawlak et aL, 
(Gawlak, S. et a/., (1997) ibid.). The synthetic gene was cloned as a 843bp Afcol- 
EcdRL restriction fragment into a modified version of bacterial expression vector 
pET22b4- (Novagen, Madison, USA). The vector was modified to remove the peDB 
20 leader sequence which was previously shown to impede efficient expression of the 
biyodin 1 protein (Gawlak, S. et a/., (1997) ibid.). The modification was conducted by 




sequence. Elements of non-peB sequence including the ribosome binding site 
sequence an<f the Ncol site, were restored in the vector by ligation of a linker molecule 



25 to the XbaUNcol free ends in the vector. The linker was formed by annealing 
complementary oligonucleotides: 



L 1 f (5 ' -ctagaaataattttgtttaactttaagaaggagatatacatatgcc) and 
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Oligonucleotides were supplied with phosphoiylated ends by Genosys 
Biotechnologies Ltd (Cambridge, UK). Restriction digests, DNA purification, ligation 
reactions etc were conducted using standard procedures and conditions recommended 
by the Tegent suppliers. Altered (reduced immuriogenicity) versions of the gene were 
5 engineered using site directed mutagenesis of the wild-type gene in the modified 
pET22b vector. Short (18-mer) synthetic oligonucleotides and the "quick-change" 
procedure and reagents from Stratagene (Cambridge, UK) were used to create the 
variant genes. All variant gene sequences were confirmed by DNA sequencing. 

10 Wild-type and mutated bryodin I gene variants were transformed into E. co/rstrain 
TGI by standard techniques. A single transformed clone was selected for each gene 
and this clone used for sequence analysis. For expression work, bryodin 1 genes were 
transformed into R coli strain BL21(XDE3) obtained from the ATCC. Recombinant 
bryodin 1 and variant bryodin 1 proteins were purified using methods described 

15 previously (Gawlak, S. et a/., (1997) ibid). Following re-folding of the crude protein 
from inclusion bodies, purified bryodin 1 and variants were obtained > 95% pure using 
CM-Sepharose chromatography as previously (Gawlak, S. et a/., (1997) ibid). 
Activity of the recombinant proteins was assessed using a cell-free protein synthesis 
inhibition assay and methods according to Siegall et a/., (Siegall, GB. et al, 

20 Bioconjugate Chem. 5 423-429(1994)). 

ffi (he present example the human interferon a2 protein is analysed for the presence of 
potential MHC binding motift and a method disclosed for the removal of i number of 
25 these from the molecule. 



Interferon alpha 2 (INA2) is an important glycoprotein cytokine expressed by 
activated macrophages. The protein has antiviral activity and stimulates the 
production of at least two enzymes; a protein kinase and an oligoadenylate synthetase, 
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on binding to the interferon alpha receptor in expressing cells. The mature INA2 
protein is single polypeptide of 165 amino acids produced by post-translational 
processing of a 188 amino acid precursor protein by cleavage of a 23 amino acid 
signal sequence from the amino terminus. 

5 

The protein has considerable clinical importance as a broad spectrum anti-viral, anti- 
proliferative and immunomodulating agent. Recombinant and other preparations of 
INA2 have been used therapeutically in a variety of cancer and viral indications in 
man (reviewed in Sen, G.G. and Lengyel P, J. Biol Chem. 267 5017-5020 (1992)). 

10 However despite very significant therapeutic benefit to large numbers of patients, 
resistance to therapy in certain patients has been documented and one important 
mechanism of resistance has been shown to be the development of neutralising 
antibodies detectable in the serum of treated patients (Quesada, J.R. et al., J. Clin. 
Oncology 31522-1528 (1985); Stein R.G. et al. y New Eng. J. Med. 318 1409-1413 

15 (1988); Russo, D. et ai, Br. J. Haem. 94 300-305 (1996); Brooks M.G. et al 9 Gut 30 
1116-1122 (1989)). An immune response in these patients is mounted to the 
therapeutic interferon despite the fact that a molecule of at least identical primary 
structure is produced endogenously in man. In the present example, an interferon 
alpha 2 molecule with reduced potential immunogenicity in man is presented. 

20 

Method for identification of potential MHC class II binding motifs in human 
interferon alpha 2? 

The sequence of INf2 wa$ identified firoin the GenBank database. The sequence with 
accession number P01563 was used throughout The protein sequences of the mature 

25 INF2 protein was identified and the sequence excluding the first 23 amino acid signal 
peptide analysed for the presence of potential MHC class II binding motife (Figure 
11). Analysis was conducted by computer using MPT verl.O software (Biovation, 
Aberdeen, UK). This software package conducts "peptide threading* 1 according to the 
methods disclosed in WO-A-9859244. The software is able to provide an index of 

30 potential peptide binding to 1 8 different MHC class II DR alleles covering greater than 
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96% of the HLA-DR allotypes extant in the human population. 

Results of the "peptide threading" process on the INF2, indicate the presence of a total 
of 18 individual potential epitopes. The epitopes map to five distinct clusters of 
5 overlapping epitopes encompassing residues 7-40 (cluster 1), residues 45-70 (cluster 
2), residues 79-103 (cluster 3), residues 108-132 (cluster 4) and residues 140-163 
(cluster 5). Each of the five clusters contain 5, 3, 4, 3 and 3 potential epitopes 
respectively. 

10 hi order to prioritise epitopes for removal, the epitope clusters were then mapped to 
the known structure-function features on the molecule. For INF2 evidence from 
homology modelling (Murgolo N.J. et al, Proteins: Structure, Function & Genetics 
17 62-74 (1993)), site directed mutagenesis (Tymms, M J. et al, Antiviral Res. 12 37- 
48 (1989); Mclnnes B. et al, J. Interferon Res. 9 305-314 (1989)), cross-species 

15 chimaeric molecules (Ra, N.B.K. et al, J. Biol Chem. 263 8943-8952 (1988); 
Shafferman, A. et al. J. Biol. Chem. 262 6227-6237 (1987)), deletion mutants (Wetzel, 
JL et al, In: Interferons New York Academic Press, pp819-823 (1982)) and 
serological mapping studies (Lydon N.B. et al., Biochemistry 1A 4131-4141 (1985); 
Trotta PJP. et al^. In: The Interferon System Dianzani F & Rossi GJ3 (eds.) New York, 

20 Raven Press, 1985 pp231-235), suggest epitopes in clusters 1 and 2 are the highest 
priority targets for removal With reference to the structural model of Murgolo et al 
Wm$® ##®993}i&wi), #$ter t Sm&Z 5 peiipiiaj epitopes ^pi^^s 
the functionally important helix A and the AB surface loop region of the INF2 
molecule. This region is important for the antiviral activity and is involved in binding 

25 to the human INF2 receptor structure. Most significantly, epitopes for neutralising 
antibodies have been mapped to this region, specifically residues 10-11 of the helix A 
structural element (Lydon NJB. et al, (1985) ibid). 

On this basis, cluster 1 epitopes at positions 7-19 and 13-25 were eliminated by 
30 substitution of threonine for leucine at position 15 (L15T using single letter codes). 
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This procedure was conducted interactively in silico using the MPTverl package. 
Similarly, cluster 1 epitope at position 28-41 was eliminated by substitution F27Y. 
This latter substitution had the concomitant effect of reducing the number of potential 
binding alleles for the overlapping epitope at position 22-34 from 13 to 11. 

5 

Cluster 2 epitopes extend from the AB loop through the helix B region and into the 
BC loop domain. Neutralising antibodies have been shown to bind in the BC loop 
(Lydon KB. et al y (1985) ibid; Trotta P.P. et a/.,. (1985) ibid. ) therefore cluster 2 
epitope at position 58-70 was also targeted for removal. Substitution N^L reduces 
10 potential immunogenicity in this region by eliminating 3 overlapping epitopes 
encompassing positions 61-77, which collectively are predicted to bind 5 different 
MHC DR alleles. Additionally, this substitution also reduces the number of different 
binding alleles at epitope 58-70 from 5 to 3. 



15 Other epitope clusters (e.g. 3-5) map to either buried regions of the molecule or 
surface regions not involved in receptor binding and hence provide antigenic sites to 
which antibody responses are laigely non neutralising. 



20 



25 



Using the above method, an altered INF2 protein sequence was compiled containing 3 
substitutions from the starting sequence and is depicted in Figure 12. This sequence is 
gregistetf signified tfess #imujiogfc# wi$ respect t*%iman »J1C dass E[ 
presentation. The areas of reduced immunogenicity focussed to regions of the 
molecule where antibody mediated neutralisation of the protein has been shown to 
occur with the potential to limit the clinical efficacy of the molecule as a therapeutic 
entity. 



30 



Method for construction of altered interferon alpha 2 molecule: 

A wild-type INA2 gene was synthesised under contract by Genosys Biotechnologies 

Ltd (Cambridge, UK). The gene was constructed by PCR using long (80-mer) 
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overlapping synthetic DNA primers and the sequence given in GenBank accession 
number M29883. The synthetic gene was cloned as a 520bp EcoRI-Hin<HIi restriction 
fragment into bacterial expression vector pMEX8 (MoBiTec, Gottingen, Germany). 
The gene sequence was confirmed to be identical to the desired gene using 
5 commercially available reagent systems and instructions provided by the supplier 
(Amersham, Little Chalfont, UK). 



Altered (reduced immunogenicity) versions of the protein were constructed by site 
directed mutagenesis of the wild-type gene in pMEX8. Mutagenesis was conducted 
using short (18-mer) synthetic oligonucleotides obtained commercially (Genosys, 
Cambridge, UK) and the "quick-change" procedure and reagents from Stratagene 
(Cambridge, UK). Following site directed mutagenesis, DNA sequences of selected 
clones were confirmed as previously. 



15 For expression of recombinant wild-type and recombinant variant INA2 proteins, 
pMEX8-INA2 expression plasmids were transformed into K coli strain JA221 and 
cells grown and harvested using standard procedures (Sambrook J., Fritisch E.F. & 
Maniatis T. (1989) ibid). Recombinant INA2 was prepared essentially as described 
previously (Grosfeld, H. ei al., in Advances in Biotechnological Processes 4, Mizrahi 

20 A. & Van Wezel A. L. eds., pp59-78 Alan R. Liss Inc, New York (1985)) with minor 
modifications. Briefly, following high speed centrifogation, the supernatant was 
c^ei^tei^y 60% satiation Of ammonium sulphate ariitten chioni#og^hed on 
a 2.7 x 68cm Sephadex G-75 column equilibrated with PBS plus 0.5M NaCL 
Fractions (8ml) Were collected at a lmin/ml flow rate. Pooled fractions were further 

25 purified using an immunoaffinity column as described previously (Grosfeld, H. 6/ a/., 
(1985) ibid). Purified proteins were assayed using SDS-PAGE analysis and 
functional activity (anti-viral activity) determined using biological assays as described 
previously (Shafferman, A. et al, J. Biol Chenu 262 6227-6237 (1987)). 
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CLAIMS 

. 1 . . A method of rendering aprotein, or part of a protein^ non-immunogenic, or less 
immunogenic, to a given species, the method comprising: 

5 

(a) determining at least part of the amino acid sequence of the protein; 

(b) identifying in the amino acid sequence one or more potential epitopes for T 
cells ("T cell epitopes'') which are found in an endogenous protein of the given 

10 species; and 

(c) modifying the amino acid sequence to eliminate at least one of the T cell 
epitopes identified in step (b) thereby to reduce the immunogenicity of the protein or 
part thereof when exposed to the immune system of the given species. 



15 



20 



2. A method as claimed in claim 1 wherein the given species is human. 

3. A method according to claim 1 or 2, wherein the amino acid sequence is 
modified by modification of a peptide which binds to MHC class II molecules. 

4. A method according to claim 1 c# 2, wherein the mm&o acid 
ion of a peptide which bitfds to MHC cla$$ I molecules. 



5. A method as claimed in any one of claims 1 to 4, wherein the T cell epitopes 
25 axe identified by computation. 

6. A method as claimed in any one of claims 1 to 5, comprising additionally 
identifying and eliminating one or more potential T cell epitopes which are not found 
in an endogenous protein of the given species. 
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7. A method as claimed in any one of claims 1 to 6, wherein an epitope 
considered as not generally available to the immune system to gain tolerance is 
identified as a potential epitope for elimination. 

5 

8. A method as claimed in claim 7, wherein an epitope of an intracellular protein 
is identified as a potential epitope for elimination. 

9. A method as claimed in any one of claims 1 to 8, wherein one or more B cell 
10 epitopes is also identified as a potential epitope for elimination. 

10. A method as claimed in any one of claims 1 to 9, wherein the protein is non- 
autologous. 

15 11. A method as claimed in any one of claims 1 to 9, wherein the protein is 
autologous. 

12. A method as claimed in any one of claims 1 to It; wherein the protein is: a 
protein with an enzymatic activity which has a beneficial therapeutic effect; a protein 
20 which is used to convert inactive drugs to active drugs within a living organism; a 
protein which is used to vaccinate whereby one or more immunogenic epitope is 

organisfa; of a protein which binds to other molecules within or introduced within the 
lining organism in drier to altar the biodistribution of the other molecules. 
25 13. A method as claimed in any one of claims 1 to 12, wherein the protein is tested 
for desired activity. 

14. A method as claimed in claim 1 comprising: 



30 
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20 



30 



L Determining the amino acid sequence of the protein or a part thereof (if 
modification only of a part is required); 

II, Identifying potential T cell epitopes within the- amino acid sequence of the 
protein by any method including determination of the binding of peptides to* 
MHC molecules, determination of the binding of peptide:MHC complexes to 
the T cell receptors from the species to receive the therapeutic protein, testing 
of the protein or peptide parts thereof using transgenic animals with the MHC 
molecules of the species to receive the therapeutic protein, or testing with 
transgenic animals reconstituted with immune system cells from the species to 
receive the therapeutic protein; 



HI. By genetic engineering or other methods for producing modified proteins, 
altering the protein to remove one or more of the potential T cell epitopes and 
15 producing such an altered protein for testing; 

IV. (optionally) Within step IIL, altering the protein to remove one or more of the 
potential B cell epitopes; 



Testing altered proteins with one or more potential T cell epitopes (and 
optionally B cell epitopes) removed in order to identify a modified protein 
which has retained all or part of its desired activity but which has lost one or 



eel 



25 15. A method as claimed in any om of claiiils 1 to 14, wherdn a T cell epitope is 
eliminated by alteration of one or more amino adds within the epitope itself 

16. A method as claimed in claim 15, wherein the alteration is amino acid 
substitution. 



17. A method as claimed in claim 16, wherein the ammo acid substitution is made 
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with reference to a homologous protein primary structure. 

18. A method as claimed in claim 16, wherein the amino, acid substitution is made 
on the basis of similar size and/or charge. 

5 

19. A method as claimed in claim 16, wherein the amino acid substitution is made 
with reference to in silico protein modelling techniques. 

20. A method as claimed in claim 13, wherein a high-throughput method is used 
10 for screening large numbers of modified proteins. 

21 . A molecule resulting from a method as claimed in any one of claims 1 to 20. 

22. A molecule as claimed in claim 21 for use in medicine or diagnosis. 

23. The use of a molecule as claimed in claim 21 in the manufacture of an 
therapeutic or diagnostic agent 



WO 00/34317 



PCT/GB99/04119 



1/4 



A 

a-chain 



ALYTLITPAVLRTDTEEQILVEAHGDSTPKQLDI FVHDFPRKQKTLFQTRVDMNPAiGGMLVTP 
TIEIPAKBVSTDSRQNQYVWQVTGPQVRL^^ 

* SMDHNTSKMNKTVIVEFQTPEGItVSSNSTO^ ' 
AYPDVRKYVLPSFEVRLQPSEKFFYIIXSNENFirVSITA^ 

SI PDSLTRI PI IDGDGKATLKRDTFRSRFPNLNELVGHTLYASVTVMTESGSDMVVTEQS^ 

IVASPYQIHFTKTPKYFKPGMPYELTVYVTNPDGSP 

NIPLNAQSLPITVRTNHGDLPRERQAT^ 

FNVKGNANSI^QI KYFTYL I LNKGKI FKVGRQPRRIX^NLVTMNLH I TPDL I PS FRFVAYYQV 
GNNErVADSVWVDVKDTCMGTLVVKGDNL I QMPGAAMKI KLEGDPGARVGLVAVDKAVYVLND 
KYKISQAKIWDTIEKSDFGCTAGSGQNNLGVFEDAGLALTTSTNLOT 



EIQMPTHKDLNLDITIELPDREVPIRYRINYENALLARTVETKLNQDITV^ 
TFYNAQLQEKANVCNKPHLNVSVENIHLNAM CTRYLGEVDSTMTI IDI SMLTG 

FLPDAEDLTRLSKGVDRYISRYEVDNNMAQKVAVIIYLNKVSHSEDECLHFKILKHFEVGFIQ 
PGSVKVYS YYNLDE KCTKFYH PDKGTGLLNKI C I GNVCRCAGETCS S LNHQER IDVPLQ I EKA 
CETNVDYVYKTKLLRIEEQIX3NDI YVMDVLEV KQGTDENPRAKTHQY I SQRKCQEALNLKVN 
DDYLIWGSRSDLLPTKDKI SYI ITKNTWIERWPHEDECQEEEFQKLCDDFAQFSYTLTEFGCP 
T 



DDNEIX3FIADSDI ISRSDFPKSWL^TKDLTO 
PTKGICVAEPYEIRVMK^FIDLOTPYSVVKNEQVEIlUVI 
^lK^J«^QQFPIiCAL9SRA^ 
K&IVTfVItoPRfta^^ 
SIPD 



B 

p-chaln 



C 

y-chain 





A = alpha chain 
B = gamma chain 
C = beta chain 
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ALYTL I TPAVLRTDTEEQI LVEAHGDSTPKQLD I FVHDFPRKQKTLFQTRVDMNPAGG>ILVTP 
TIEIPAKEVSTDSRQNQYVWQVTGTQVRLEKVV^ 

SMDHNTSKMNKTVI VBFQTPEGI LVS SNSVD SLNFFWPYNL PDLVS LGTWR I VAKYEHS PENY 
TAYFDVRKYVLPS FEVI VE PTEKFYY I DGNENFHVS I TARYLYGEBVEGVAFVLFGVKI DDAK 
I SLPESLKRI PI IDGIXjKATLSKDTFRSRFPNLNELVGKSLYVS 
HIVASPYQIHHsCTPKYtfK^ 

LNI PLNAQSLP ITVRTNHGDLPRERQATKSMTA1 AYQTQGGSGNYLHVA1TSTE I KPGDttLPV 
NFNVKGNANSLAQI KYFTYLILNKGKI FKVGRQPREPGQDLWLNLHITPDLI PSFRLVAYYT 
L I GASGNNE I VADSVWVDVKD S CVGS I WKGDNL I QM PGAAMKI KLEGDPGARVGLVAVDKAV 
FVLNDKYKISQAKIWDTVVBKADIGCT^^ 
PAN 

FIGURE 2 

Protein sequence of an altered cobra venom factor ot-chain. 



DDNEIXSFIADSDIISRSDFPKSWLWLTKDLTEEPNS^ 
PTKGICVADPFT2VTVMK\^FIDLQM 

ASTTGQR YRQQFP I KALS S RAVP FVI VPLEQGLHDVEVKAAVYHH F I SDGVRKKLKW PEGVQ 
KSTVTI VKLDPRAKGVGGTQLEVI KARKLDDRVPDTE I ETKI I IQGDPVAQI I ENS IDGSKLN 
SIPD 

FIGURE 3 

Protein sequence of an altered cobra venom factor y-chain. 



EIQMPTHKDIiOTiDITIELPDREVPi^^ 
FI^DAEBLf^^ 

PGSVKVYSY¥NLPESCT3RFYHPD^ 

ceinvdyv^ot 

dhyliwgsrsdixptkdkisyiig10)tw^^ 

T 



FIGURE 4 

Protein sequence of an altered cobra venom factor 0~chaia 
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IAGPEWLLDRPSVNNSQLWSyAGTVEGTNQDI SLKFFE IDLTSRPAHGGKTBQGLSPKS 

KPFATDSGAMPHKLEKADLLKAIQEQLIANVHSNDDYFEVIDFASDATITDRNGKVYFAD 

KDGSVTLPTQPVQEFLLSGHVRVRPYKBKP I QNQAKSVDVBYTVQFTPLNPDDDFRPGLK 

DTKLLKTIiAJGDTITSQELLAQAQSILNKTHP 

FTYHVKNREQAYEINKKSGLNEEINiniDLISBKYY^ 

DVNTNELLK^EQLLTASERNLDFtoLYDPRDKAK^ 

TNRI ITVYMGKRPEGENAS YHIiAYDKDRYTEEEREVYSYLRYTGTPI PDNPNDK 



Protein sequence of streptokinase from Streptococcus equisimilis 



I AGPEWLLDRPSVNNSQLVVSVAGTVEGTNQDI SLKFFE IDLTSRPAHGGKTBQGLSPKS 
KPFATDSGAMPHKLEKADLLKAKQEQLIANVHSNDDYFEVIDFASDATI 
KDGSVTLPTQPVQEFLLSGHVRVRPYKE KP I QNQAKSVDVEYTVQFTPLNPDDDFRPGLK 
DTKLLKTLAIGDTITSQELLAQAQSILNKTHPGYT^ 

FTYHVKNREQAYE INKKSGLNEB INNTDL I SEKYYVLKKGEKPYDPFDRSHLKLFTIKFV 
DWTNELLKSEQLLTASERNLDFRDLYDPRDKAKLLYNNLDAFGIMDYTLTGKVEDNHDD 
TNRI ITVYMGKRPEGENAS YHLAYDKDRYTEEEREVYS YLRYTGTPI PDNPNDK 



Protein sequence of an altered streptokinase molecule. 



SSS FDKGKYKKGDDAS YFEPTGPYIJWN^^X3VDSKGNBLLS PHAVEFPI KPGTTLTKEKIEYY 
VEWALDATAYKEFRVVELDPSAKIE\nTYDKNKKKEETKS F P I TEKGFWPDLS EHI KNPGFN 
LITKWIEKK 



SSSFDKGKHCBGDDASQFEPTGPYLMVNV^^ 

EWALDATAYAAFAVVEU)^ 

FTKWIEKK 



FIGURE 5 



FIGURE 6 



FIGURE 7 




FIGURE 8 



Protein sequence of an altered staphylokinase molecule. 
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DVS FRLSGATTTS YGWIKNUIEALPYERKVYNI PLLRS S I SGSGRYTLLHLTNYADETI SVA 
VDVTNVYI^YIAGDVS YFFNEASATBAAKFVFKDA 

LGLPALDSAITTLYYYTASSAASALLVLIQSTAESARYKF I EQQIGKRVDKTFLPSLATI SLE 
NNWS ALS KQI Q IASTNNGQFES PWLI DGNNQRVS I TNASARVVTSNIALLLNRNN IAAI GED 
I SMTLIGFEHGLYGI 

FIGURE 9 

Protein sequence of wild-type bryodin 1 . 



DVS FSMSGATTTS YGVFVKNLREALPFERKVYNI PLLRS S I SGSGRYTLLHLTNYADETI SVA 
VDVTNVYIMGYLAGDVSYFFNEASATEAAKFVFKDAKKKV^ 

LGLPALDSAITTLYYYTASSAASALLVLI qstaesarfkfieqqigkrvdktflpslati slb 

NNWSALSKQIQIASTNNGQFESPVVLVDGNNQSVSITNASARVVT^ 

I SMTLIGFEHGLYGI 

FIGURE 10 

Protein sequence of an altered bryodin 1 molecule. 



OTLPQTHSLGSRRTLMLLAQMRKISLFSCLKDIOT 

FNLFSTKDSSAA)TOETLLDKFYTELYQQLOT 

TLYLKEKKYSPCAWEVVRAEIMRSPSLSTNLQESLRS^ 

FIGURE 11 

Protein sequence of mature human interferon alpha 2 



CDLPQTHSiraraTTM^ 

FLLFSTKDSSAAWDETLLDKFYTELY^^ 

TLYLKEKKYSPCAWEVVRAEIMRSFSLSTNLQESLRSKE 



FIGURE 12 

Protein sequence of an altered human interferon alpha 2 molecule 
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